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THE  AM3ERS0N-DARLING  STATISTIC 


1.  Introduction. 

The  Anderson  Darling  Statistic  is  a  member  of  the  group  of 

Goodness-of-Fit  statistics  which  has  come  to  be  known  as  EDF  statistics 

(Stephens,  1974)  because  they  are  based  on  a  comparison  of  the  empirical 

distribution  function  of  a  given  sample  with  the  theoretical  distribution 

to  be  tested.  It  is  designed  to  test  that  random  variable  X  has  a  continuous 

cumulative  distribution  F(xj©)j  6  is  a  vector  of  one  or  more  parameters 

entering  into  the  distribution  function.  Thus  for  the  normal  distribution, 

2 

the  vector  Q  =  (p,a  ). 

The  empirical  distribution  function  (EDF)  is  defined  as 

number  of  sample  values  <,  x 
n  * 


F  (x)  = 
n v 


where  the  n  values 
of  X.  From  the  x^ 
in  ascending  order. 


x^ ,  x2,  . . . ,  xn  are  assumed  to  be  a  random  sample 
(l)-’  X(2)’  X(n)  order  statistics. 


let  x 


Fn(x)  is  then  defined  by 


F  (x)  =  0  ,  x  <  x 

n  (1) 

F  (x)  =  i/n  ,  x  .  —  x  <  x , .  . . ,  i=  l,...,(n-l) 

n  (i)  d+l) 

F  (x)  =  1  ,  x  <  x  . 

n  (n) 


Since  F  (x)  gives  the  proportion  of  a  random  sample  <  x  ,  one  might  expect 
n 

it  to  give  a  good  estimate  of  F(x;0),  which  is  the  probability  of  X  less 
than  x,  and  F  (x)  is  in  fact  a  consistent  estimator.  It  is  therefore 
natural  to  test  whether  the  sample  appears  to  come  from  F(x;e)  by  using 
a  statistic  based  on  the  discrepancy  between  ^n(x)  and  F(xj©). 
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Many  statistics  of  this  type  have  been  proposed,  the  most  famous,  and 
one  of  the  oldest,  being  the  Kolmogorov  statistic  D.  This  statistic 
is  based  on  the  largest  vertical  discrepancy  between  the  two  functions. 
An  alternative  measure  is  the  Cramer-von  Mises  family,  based  on  the 
squared  integral  of  the  difference  between  the  EDF  and  the  distribution 
tested: 


*  f  ■  2 

W  =  I  {Fn(x)  -  F(x;0)}  \|r(x)  dx  ;  (l) 

J—  00 

the  function  i|r(x)  gives  a  weighting  to  the  squared  difference.  One 

■Jf  -«•  ^ 

member  of  W  is  the  Cramer-von  Mises  statistic  itself,  W  with 
i|r(x)  =  1. 

The  Anderson  Darling  statistic,  the  subject  of  this  article,  is 
W*  with 


i|r(x)  =  [  {F(x; 6 ))  (l  -  F(x; 6 )}  ] ~  . 

This  weight  function  counteracts  the  fact  that  the  discrepancy 
between  Fn(x)  and  F(xj@)  is  necessarily  becoming  smaller  in  the  tails, 
since  both  approach  0  and  1  at  the  extremes.  The  weight  function 
given  weights  the  discrepancy  by  a  factor  inversely  proportional  to  its 
variance,  and  has  the  effect  of  giving  greater  importance  to  observations 
in  the  tail  than  do  most  of  the  EDF  statistics.  Since  tests  of  fit  are  often 
needed  implicitly  or  explicitly  to  guard  against  wayward  observations  in  the 
tails,  the  statistic  is  a  recommended  one,  with,  as  we  shall  see,  generally 
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good  power  properties  over  a  wide  range  of  alternative  distributions  when 
F(x;0 )  is  not  the  true  distribution. 


2.  Computing  Formula. 

For  practical  purposes,  the  definition  of  the  Anderson-Darling 
statistic  given  above  needs  to  be  turned  to  a  computational  formula. 
This  is  done  in  the  following  sequence  of  steps: 


(a)  Calculate  z^  =  F  (x^y@),  i  =  l,...,n 


(b)  The  Anderson-Darling  statistic  is  given  by 
p  n 

A  =  -{  £  (2i-l)[ln  z  +ln(l-zn+1_i)]}/n-  n  . 
i=l  ~  "  J 


(2) 


Note  that  since  the  x^  are  in  ascending  order,  the  will 

also  be  in  ascending  order,  though  the  usual  notation  of  order  statistics 
has  been  omitted. 


3 •  Goodness-of-fit  test  for  a  completely  specified  continuous  distribution. 

The  formula  for  z^  above  assumes  that  the  tested  distribution  F(x;0) 

is  completely  specified,  i.e.,  the  parameters  in  0  must  be  known.  When 

2 

this  is  the  case  we  describe  the  situation  as  Case  0.  The  statistic  A 

was  introduced  by  Anderson  and  Darling  (1952,  195^- )>  and  for  Case  0  they 

gave  the  asymptotic  distribution  and  tables  of  percentage  points.  For 

2 

testing  purposes  the  upper  tail  of  A  will  be  used;  large  discrepancies 
between  the  EDF  and  the  tested  distribution  will  indicate  a  bad  fit. 

Later,  Lewis  (1961)  demonstrated  that  the  distribution  of  A  for  a  finite 
sample  approaches  the  asymptotic  distribution  extremely  quickly,  so  that 
for  practical  purposes  only  the  asymyptotic  distribution  is  required  for 


3 


sample  sizes  greater  than  5.  A  table  of  percentage  points  is  given  in 

2 

Table  1.  To  mate  the  goodness  of  fit  test,  A  is  calculated  as  in 
Equation  (2)  above,  and  compared  with  these  percentage  points; 

the  null  hypotheses  that  random  variable  X  has  the  distribution  F(x;0) 

2 

is  rejected  at  level  a  if  A  exceeds  the  appropriate  percentage 
point  at  this  level. 

A.  Asymptotic  theory  of  the  Anderson-Darling  statistic. 

2 

The  distribution  of  A  for  Case  0  is  the  same  for  all  distributions 
tested.  This  is  because  the  probability  integral  transformation  is  made 

at  step  (a)  and  the  values  of  z^  are  ordered  values  from  a  uniform  distri- 

2 

bution  with  limits  0  and  1.  A  is  therefore  a  function  of  ordered 
uniform  random  variables.  The  asymptotic  distribution  theory  for  this 
special  case  can  be  found  from  the  asymptotic  theory  of  the  EDF,  or  more 
specifically  of  the  function 

yn(z)  =7n(Fn(z)-z)  , 

where  F  (z)  is  the  EDF  of  n  uniform  random  variables  as  above.  For  a 
modern  treatment  of  the  empirical  process  given  by  yn(z)  &ee  Durbin  (1973 a, 
1973b).  When  0  contains  unknown  components,  the  z^  given  by  the  transfor- 

nation  (a)  above,  when  an  estimate  0  replaces  0,  will  not  be  ordered  uniform 

2 

random  variables  and  the  distribution  theory  of  A  ,  as  for  all  other 

EDF  statistics,  becomes  substantially  more  difficult.  In  general,  the 

2 

distribution  of  A  ,  and  of  other  EDF  statistics,  will  depend  on  n  and 
also  on  the  values  of  the  unknown  parameters. 


k 


Fortunately,  an  important  simplification  occurs  if  unknown  components 
of  6  are  location  and  scale  parameters  onlyj  then  the  distribution  of 
each  EDF  statistic,  with  an  appropriate  estimate  for  e,  will  depend  on 
the  distribution  tested,  but  not  on  the  specific  values  of  the  unknown 

parameters.  Thus  for  the  test  for  the  normal  distribution,  for  example, 

2 

with  unknown  p  and  cr  ,  only  one  set  of  tables  would  be  needed,  of 

percentage  points  for  each  n.  This  simplification  makes  it  worthwhile 

2 

to  calculate  the  asymptotic  theory  for  A  and  other  EDF  statistics, 
and  this  has  been  done  for  the  normal  case  and  the  exponential  case  by 
Stephens  1976)  and  Durbin,  Knott  and  Taylor  (1975)*  Stephens, 

from  Monte  Carlo  studies  to  find  the  distributions  of  EDF  statistics 
for  finite  n,  has  calculated  modifications  of  the  basic  statistics j 
these  are  functions  of  the  statistic  and  of  n  which  can  be  used  with 
only  the  asymptotic  percentage  points.  Thus  only  one  line  of  percent¬ 
age  points  is  needed  to  make  the  test.  The  technique  is  set  out 
below.  Stephens  (1977 >  1978)  has  also  done  similar  distribution  theory 
to  provide  tests  for  the  extreme  value  distribution  (these  can  be  used 
for  the  Weibull  distribution  also)  and  for  the  logistic  distribution. 
Pettitt  and  Stephens  (1976)  have  given  tests  for  the  Gamma  distribution 
with  unknown  scale  parameter  but  known  shape  parameter.  From  all  these 
results,  we  set  out  the  technique  for  goodness -of-fit  testing  as  it 
applies  to  . 

5*  General  procedure  for  any  distribution  with  unknown  location  or 
scale  parameter. 

The  first  step  in  testing  goodness -of-fit  for  any  of  these  distribu¬ 
tions  is  to  estimate  the  unknown  parameters.  This  should  be  done  by 
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maximum  likelihood,  for  the  modifications  and  asymptotic  theory  to  hold. 

A 

Suppose  that  9  is  the  vector  of  parameters,  with  any  unknown  parameters 
estimated  as  above.  The  steps  continue  as  follows: 

(a)  Calculate  z^  =  F(x^j}6),  i  =  l,...,n  . 

2 

(b)  Calculate  A  from  the  formula  (2)  above. 

2 

(c)  Modify  A  by  the  formula  in  the  appropriate  table  below,  and 
compare  with  the  line  of  percentage  points  given. 


6.  Tests  for  different  distributions. 

It  is  worthwhile  to  set  forth  the  practical  details  of  these  calcula¬ 
tions,  for  each  distribution  separately. 

6.1.  Tests  for  the  normal  distribution.  We  here  distinguish  three  cases: 

Case  1:  The  mean  p  is  unknown  and  is  estimated  by  x,  but  a2  is  known} 
Case  2:  The  mean  p  is  known,  and  cr  is  estimated  by  E^(x^-p)  /n(=  s^,  say) 
Case  3-  Both  parameters  are  unknown  and  are  estimated  by  x  and 
s2  =  £.  (x.  -x)2/  (n-l). 

l  i  '  '  v  ' 


For  these  cases  the  calculation  of  z^  is  done  in  two  stages.  First 
w.  is  found  from 

l 


x(i)-x 

w.  =  — '  — - 

1  cr 


(Case  l); 


X/.  N'd 


w.  = 

l 


Ill 


(Case  2)}  w. 


X(i)"X 


(Case  3 )} 


then  z^  is  the  cumulative  probability  of  a  standard  normal  distribution, 

to  the  value  w^,  found  from  tables  or  computer  routines.  The  value  of 
2 

A  is  then  calculated  as  described  in  Section  2.  To  make  the  test  use  the 
modification  and  percentage  points  given  in  Table  2,  for  the  appropriate  case. 
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Illustration.  Hie  following  value  of  men's  weights  in  pounds,  first 


given  by  Snedecor,  were  used  by  Shapiro  and  Wilk  [17]  as  an  illustration 
of  a  test  for  normality:  lM3,  154,  158;  160,  l6l,  162,  166,  170,  182, 

195;  236.  The  mean  is  172  and  the  standard  deviation  24. 95-  For  a  test 
for  normality  (Case  3);  the  values  of  w^  begin  w^  =  (148-172 )/2h-. 95  = 

-O.982,  and  the  corresponding  z is,  from  tables,  O.168.  When  all 

2 

the  z^  have  been  found,  the  formula  in  Section  2  gives  A  =  0.947* 

Now  to  make  the  test,  the  modification  in  Table  2  first  gives 

A*  =  A2(1.0  +0.75/11.0  +  2.25/121.0)  =  A2  (1.0868)  =  1.029  ; 

when  this  value  is  compared  with  the  percentage  points  in  Table  2,  for  Case  3; 
the  sample  is  seen  to  be  significant  at  approximately  the  1  percent  level. 


6.2.  Tests  for  the  exponential  distribution.  The  distribution  tested 
is  F(x)  =  1-exp  (l-x/P),  x  >  0,  inscribed  as  Exp(x,P),  with  P  an 


unknown  positive  constant.  Maximum  likelihood  gives  P  =  x,  so  that  z^ 

are  found  from  z^  =  1-exp (-x^/x),  i  =  l,...,n.  A  is  calculated  as 

*  * 

in  Section  2,  modified  to  give  A  by  the  formula  in  Table  3,  and  A 

is  compared  with  the  percentage  points  in  Table  3. 

For  the  more  general  exponential  distribution  given  by 
F(x)  =  1-exp (- (x-a )/p ),  x  >  a,  when  both  a  and  P  are  unknown,  a 
convenient  property  of  the  distribution  may  be  used  to  return  the  test 
situation  to  the  case  just  described  above.  The  distribution 
y^^  =x^+1^-x^  is  made,  for  i  =  1,  ...,n-lj  the  n-1  values 
of  y^  are  then  used  to  test  that  they  come  from  Exp(y;P)  as 
just  described.  The  substitution  to  y^  reduces  the  sample  size 
by  one,  but  eliminates  a  very  straightforwardly. 
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6.3*  Tests  for  the  extreme  value  distribution.  The  distribution  tested 


is  here  F(x;@)  =  exp[ -exp{ - (x-a )/P  )}  ] ,  (-00  <  x  <  co),  with  9  =  (a,P); 
a  and  P  are  constants,  P  positive.  As  for  the  normal  distribution 
we  distinguish  three  cases: 

Case  1:  p  is  known  and  a  is  estimated; 

Case  2:  a  is  known  and  P  is  estimated; 

Case  3:  a  and  P  are  both  unknown,  and  must  be  estimated. 

Maximum  likelihood  estimates  of  cl  and  P  are  given  by  solving 
equations : 


P 


A 

2.x./ n  -  fe.x  .exp(-x  ./ P  ))/{£  .exp(-x  . 
0  3  1  0  3  3  3 


/P)},  a  =  -P  log{Z.exp(-x  /p)/n}. 

J  tJ 


A 

The  first  equation  is  solved  iteratively,  and  then  cl  can  be  found. 

A  A 

Case  1,  P  is  known;  then  P  replaced  P  in  a.  In  case  2,  Q!  is 

A 


known;  suppose  then  that  y  =  x^-ct, P  is  given  by  solving 


In 


P  =  (2  y  -  Z  y  .exp(-y  /P)}/n  . 

tJ  tJ 

2 

These  are  then  used  in  F(x;0)  to  give  z^  and  hence  A  .  The 
modifications  and  percentage  points  for  the  different  Cases  are 
given  in  Table  k. 

S.b.  Tests  for  the  Weibull  distribution.  The  distribution  tested,  in  its 
most  general  form,  is 
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(3) 


F(xj0)  =  1  -  exp[-{  (x-oO/P}^],  (x>a), 

•with  0  =  (Q!,P ,/ );  P  and  y  must  be  positive.  Mien  a  is  known, 
the  substitution  Y  =  -ln(X-Ot)  gives,  for  the  distribution  function 
for  Y,  F(y)  =  exp[ -exp{  -  (y-O!'  )/P' }  ],  (y  >  a'  ),  where  P'  =  l/y 
and  oc'  =  -in  P,  so  that  Y  has  the  extreme  value  distribution 
considered  above.  A  test  for  the  Weibull  distribution,  with  oc  known 
but  P,  7  unknown  therefore  can  be  made  as  follows: 

(a)  Find  y^j  =  -inCx^^^  ^-cc)  ,  i  =  l,..,,n. 

(b)  Test  that  y  ^  is  a  sample  (now  placed  is  ascending  order  by 

step  (a))  from  the  extreme  value  distribution  with  two  unknown  parameters, 
as  described  in  Section  6.3,  Case  3. 

Note  also  that  if,  in  addition  to  OC,  y  is  known  in  (3),  the 
substitution  Y  =  -ln(X-C£)  gives  an  extreme-value  distribution  for  Y 
with  scale  parameter  P1  now  known  (Case  1  of  Section  6.3);  if  a  and 
P  are  both  known  in  (3),  the  substitution  gives  an  extreme -value 
distribution  for  Y  with  location  parameter  02  known  (Case  2  of 
Section  6.3 ) 

6.5*  Tests  for  the  logistic  distribution.  The  distribution  tested  is 
F(x;0)  =  [1  +  exp{ - (x-a )/p }  ]_1,  (x  >  a),  with  0  =  (a,P);  a,p  are 
constants,  with  P  positive.  Again  t  hree  cases  are  distinguished 
(Stephens,  1979 ): 

Case  1:  P  is  known,  and  oc  must  be  estimated; 

Case  2:  a  is  known,  and  P  must  be  estimated; 

Case  3-  Both  oc  and  P  are  unknown  and  must  be  estimated. 
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Maximum  likelihood  estimates  are  given  for  Case  3  by  the  equations: 


21  [1  +  exp{  (xi-o:)/P}  ]  1  =  n/2  } 


x.-a  1  -  exp{  (x.  -a)/P] 

=  -n 

P  1  +  exp{  (xi-o:)/p} 


These  may  be  solved  iteratively,  using,  for  example,  x  and  S-/3  /« 
as  starting  estimates  of  &  and  P.  In  Case  1,  only  the  first  equation 


A 

is  needed,  with  P  replacing  P,  and  in  Case  2  only  the  second 

A 

equation  is  used,  with  ot  replacing  a.  In  the  transformation 


2 

and  A  is  calculated  from  the  formula  (2).  The  modification  to 
A  ,  and  the  percentage  points  of  A*  are  given  in  Table  3. 


6.6.  Tests  for  the  Gamma  distribution  with  known  shape  parameter. 

The  density  under  test  is  f(x;6)=  {r(m)pm)  1  xm  1  e  ,  x  >  0, 

and  the  distribution  is  F(xje)  =  fX  f(t}0)dt.  The  parameter  vector 

0 

6  =  (m,P)  contains  m  as  shape  parameter  and  P  as  scale  parameter} 
note  that  the  test  involving  an  unknown  location  parameter  is  not 
considered.  In  the  test  which  follow,  we  assume  m  is  known}  P  is 

A  —  _ 

then  estimated  by  P  =  m/x,  where  x  is  the  sample  mean.  Estimated 

density  f  (m3 0 )  is  f(m}6)  above  with  P  replaceing  P,  and  F(x}e) 

is  defined  in  a  similar  way.  Then  for  the  goodness-of-fit  test,  values 

z.  are  calculated  from  z.  =  F(x/.  \}0),  and  A^  calcuMed  as  in  Section  2. 
1  1  (1 ) 

The  modified  form  A  ,  and  tables  of  percentage  points  for  A  ,  are  given 
for  various  m  in  Table  6. 
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7.  Power  of  the  Anderson-Darling  Statistic. 


As  -was  described  in  the  introduction,  the  Anderson-Darling 
2 

statistic  A  gives  weight  to  observations  in  the  tails  of  the 

distribution  tested,  whereas  other  statistics  sometimes  have  the 

2 

effect  of  giving  less  importance  to  these  observations.  A  can 
therefore  be  expected  to  be  powerful  in  detecting  alternatives  which 
have  high  probability  of  giving  observations  in  the  tails.  Several 
studies  have  been  made,  for  example,  on  tests  for  the  uniform  distri¬ 
bution  with  limits  at  0  and  1.  Phis  is  the  distribution  of  the  z. 

l 

in  Section  2,  in  the  Case  0  situation  where  the  tested  distribution 

is  completely  specified.  The  type  of  alternative  to  uniformity  generally 

considered  has  distribution  function  F(x)  =  x  or  F(x)  =  l-(l-x)  , 

0  <  x  <  1  with  k  >  0.  These  distributions  produce  points  which  are 

close  to  1  or  close  to  0  respectively.  Simple  modifications  of 

the  distributions  will  produce  densities  with  a  peak  at  0.5,  or  with 

the  minimum  valueat  0.5  and  high  values  in  either  tail.  There  is 

generally  a  clear  difference  in  behavior  of  EDF  statistics  in  detecting 

these  alternatives.  (Stephens,  197*4-;  Quesenberry  and  Miller,  1977; 

2 

Locke  and  Spurrier,  1978).  The  statistic  A  will  detect  alternatives 
which  produce  observations  towards  0  or  1,  but  other  statistics  of 
the  EDF  class  are  more  suitable  for  alternatives  which  produce  a  cluster 
near  0.5  (Stephens,  197*0- 

In  the  important  situations  where  parameters  must  be  estimated,  the 
differences  in  powers  of  the  EDF  statistic  appear  to  level  out;  the 
opportunity  to  estimate  parameters  means  that  F(x;6J  is  brought  close 
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to  the  EDF  of  the  sample,  and  the  z^  values  of  Section  2  are  super - 

uniform  i.e.,  more  regular  than  a  genuine  uniform  sample.  Even  with  an 

alternative  distribution  to  the  null,  the  z^  do  not  take  very  extreme 

departures  from  uniformity.  In  these  circumstances  the  powers  of  the 

various  EDF  statistics  are  not  so  different  among  themselves  as  they 

are  for  the  Case  0  situation;  see,  e.g. ,  tests  for  normality  reported 

in  Stephens  (1974).  Nevertheless,  because  of  the  importance  it  gives 

2 

observations  in  the  tails,  A  appears  overall  to  be  an  effective  EDF 

statistics  in  these  situations.  It  compares  very  favorably  with 

statistics  also  devised  for  testing  for  special  distributions,  e.g.  the 

Shapiro-Wilk  statistics  for  testing  normality,  or  exponential! ty 

(Shapiro  and  Wilk,  1965,  1972;  Stephens,  1974,  1978).  The  statistic 
2 

A  has  the  merit  of  being  easy  to  calculate,  and,  using  the  modifica¬ 
tions,  easy  to  apply  with  only  one  line  of  percentage  points  for  each 

test  situation. 

8.  Related  Topics. 

p 

This  article  has  been  concerned  with  the  use  of  A  for  testing 
goodness -of -fit  of  one  sample,  for  a  variety  of  distributions.  There 
occur  problems  in  which  several  samples,  usually  of  small  size,  are 
available,  and  one  wishes  to  combine  the  information  in  those  samples 
to  make  an  overall  goodness -of -fit  test.  Pettitt  (1977)  has  provided 
tables  from  which  one  can  obtain  the  significance  level  p. ,  i  =  l,...,k, 

of  k  such  tests,  in  the  Case  3  situation  of  a  test  for  the  normal 

distribution  (Section  6.1  above).  The  values  pi  are  then  combined 
using  Fisher’s  well  known  method. 
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Pettitt  ( 1976)  has  also  given  a  two  sample  version  of  the  Anderson- 
Darling  statistic;  like  the  two  sample  versions  of  other  EDF  statistics, 
it  is  essentially  a  rank  test.  Pettitt  adds  some  asymptotic  power 
comparisons  with  other  two  sample  rank  tests,  which  show  that  A2 
compares  very  favorably. 
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been  calculated.  The  percentage  points  given  can  be  used  with  unmodified 
for  n  >  20. 
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Modified  form  A*  and  upper  tail  percentage  points  for  a  test  for  the  gamma 
distribution  with  known  shape  parameter  m  and  unknown  scale  parameter,, 
Section  6.6. 
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